Modeling and predicting the popularity of online contents with Cox proportional hazard regression model

نویسندگان

  • Jong Gun Lee
  • Sue B. Moon
  • Kavé Salamatian
چکیده

We propose a general framework which can be used for modeling and predicting the popularity of online contents. The aim of our modeling is not inferring the precise popularity value of a content, but inferring the likelihood with which the content will be popular. Our approach is rooted in survival analysis which deals with the survival time until an event of a failure or death. Survival analysis assumes that predicting the precise lifetime of an instance is very hard but predicting the likelihood of the lifetime of an instance is possible based on its hazard distribution. Additionally we position ourselves in the standpoint of an external observer who has to model the popularity of contents only with publicly available information. Thus, the goal of our proposed methodology is to model a certain popularity metric, such as the lifetime of a content and the number of comments which a content receives, with a set of explanatory factors, which are observable by the external observer. Among various parametric and non-parametric approaches for the survival analysis, we use the Cox proportional hazard regression model, which divides the distribution function of a certain popularity metric into two components: one which is explained by a set of explanatory factors, called risk factors, and another, a baseline survival distribution function, which integrates all the factors not taken into account. In order to validate our proposed methodology, we use two datasets crawled from two different discussion forums, forum.dpreview.com and forums.myspace.com, which are one of the largest discussion forum dealing various issues on digital cameras and a discussion forum provided by a representative social networks. We model two difference popularity metrics, the lifetime of threads and the number of comments, and we show that the models can predict the lifetime of threads from Dpreview (Myspace) by observing a thread during the first 5–6 days (24 h, respectively) and the number of comments of Dpreview threads by observing a thread during first 2–3 days. & 2011 Published by Elsevier B.V.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Penalized Estimators in Cox Regression Model

The proportional hazard Cox regression models play a key role in analyzing censored survival data. We use penalized methods in high dimensional scenarios to achieve more efficient models. This article reviews the penalized Cox regression for some frequently used penalty functions. Analysis of medical data namely ”mgus2” confirms the penalized Cox regression performs better than the cox regressi...

متن کامل

Life Duration of New Firms in Iranian Manufacturing Industries Using Cox Proportional Hazard Model

In this paper, the Cox proportional hazard model is used to answer several questions. In general, fourteen variables are applied in four groups: firm, industry, expenditure human resources specific characteristics as well. According to the previous literature in this field, the findings of this paper also show that the factors which affect life duration of firms are different between industries...

متن کامل

Comparison of Artificial Neural Networks and Cox Regression Models in Prediction of Kidney Transplant Survival

Cox regression model serves as a statistical method for analyzing the survival data, which requires some options such as hazard proportionality. In recent decades, artificial neural network model has been increasingly applied to predict survival data. This research was conducted to compare Cox regression and artificial neural network models in prediction of kidney transplant survival. The prese...

متن کامل

Comparison of Artificial Neural Networks and Cox Regression Models in Prediction of Kidney Transplant Survival

Cox regression model serves as a statistical method for analyzing the survival data, which requires some options such as hazard proportionality. In recent decades, artificial neural network model has been increasingly applied to predict survival data. This research was conducted to compare Cox regression and artificial neural network models in prediction of kidney transplant survival. The prese...

متن کامل

Evaluation of Survival Analysis Models for Predicting Factors Infuencing the Time of Brucellosis Diagnosis

Background:Brucellosis or Malta fever is one of the most common zoonotic diseases in the world. In addition to causing human suffering and dire economic impact on animals, due to the high prevalence of Brucellosis in the western regions of Isfahan province, this study aimed to analyze effective factors in the time of Brucellosis diagnosis using parametric and semi-parametric mo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neurocomputing

دوره 76  شماره 

صفحات  -

تاریخ انتشار 2012